Using Stroke-Number-Characteristics for Improving Efficiency of Combined Online and Offline Japanese Character Classifiers
نویسندگان
چکیده
We propose a new technique for normalizing likelihood of multiple classifiers prior to their combination. During a combination process we utilize the information about their efficiency correctly recognize a character with a given stroke number. In the beginning, we show that this recognizer’s efficiency based on a stroke number is different for a common on-line and off-line recognizer. Later, we demonstrate on elementary combination rules, such as sum-rule and max-rule that using this information increases a recognition rate.
منابع مشابه
A new warping technique for normalizing likelihood of multiple classifiers and its effectiveness in combined on-line/off-line japanese character recognition
We propose a new technique for normalizing likelihood of multiple classifiers prior to their combination. Our technique takes classifier-specific likelihood characteristics into account and maps them to a common, ideal characteristic allowing fair combination under arbitrary combination schemes. For each classifier, a simple warping process aligns the likelihood with the accumulated recognition...
متن کاملVirtual Example Synthesis Based on PCA for Off-Line Handwritten Character Recognition
This paper proposes a method to improve off-line character classifiers learned from examples using virtual examples synthesized from an on-line character database. To obtain good classifiers, a large database which contains a large enough number of variations of handwritten characters is usually required. However, in practice, collecting enough data is time-consuming and costly. In this paper, ...
متن کاملThe Impact of Large Training Sets on the Recognition Rate of Offline Japanese Kanji Character Classifiers
Though it is commonly agreed that increasing the training set size leads to improved recognition rates, the deficit of publicly available Japanese character pattern databases prevents us from verifying this assumption empirically for large data sets. Whereas the typical number of training samples has usually been between 100-200 patterns per category until now, newly collected databases and inc...
متن کاملAccumulated-Recognition-Rate Normalization for Combining Multiple On/Off-Line Japanese Character Classifiers Tested on a Large Database
This paper presents a technique for normalizing likelihood of multiple classifiers, allowing their fair combination. Our technique generates for each recognizer one general or several stroke-number specific characteristic functions. A simple warping process maps output scores into an ideal characteristic. A novelty of our approach is in using a characteristic based on the accumulated recognitio...
متن کاملبازشناسی برخط حروف مجزای دستنویس فارسی بر اساس تشخیص گروه بدنه اصلی با استفاده از ماشین بردار پشتیبان
In this paper a new method for the online recognition of handwritten Persian characters has been proposed which uses a set of simple features and Support Vector Machine (SVM) as a classifier. The task of preprocessing allows us to equalize feature vectors from different characters. This algorithm is implemented in two steps. In the first step, input character is classified into one of eighteen ...
متن کامل